A Novel Method of Data Stream Clustering Based on Wavelet Timing Series Tree Synopsis
نویسندگان
چکیده
For the difficulty of obtaining cluster result fast and effectively under the limitations of bounded memory and time, this paper proposes a novel data stream clustering method based on wavelet timing series tree synopsis to solve the problem. The proposed method considers the attenuation characteristic of data stream, which combines the dynamic maintenance of wavelet coefficient and attenuation feature of wavelet coefficients of data stream, and can achieve approximate representation of data stream fragment information and dynamic maintenance of its synopsis structure. The proposed method employs wavelet timing series tree synopsis method to compress data stream fragment, then adopts two-phase density clustering algorithm to cluster. Detailed experiments show that the proposed method can get high compression quality, good space and time efficiency and good clustering results.
منابع مشابه
A novel spatial clustering method based on wavelet network and density analysis for data stream
With the limited memory and time, a fast and effective clustering can’t be achieved for massive, highspeed data stream, so this paper mainly studies the key method of data stream clustering under the restriction of resource, and then proposes a dynamic data stream clustering algorithm (D-DStream) based on wavelet network and density, which uses sliding window to process data stream. Firstly, ap...
متن کاملSignal processing approaches as novel tools for the clustering of N-acetyl-β-D-glucosaminidases
Nowadays, the clustering of proteins and enzymes in particular, are one of the most popular topics in bioinformatics. Increasing number of chitinase genes from different organisms and their sequences have beenidentified. So far, various mathematical algorithms for the clustering of chitinase genes have been used butmost of them seem to be confusing and sometimes insufficient. In the...
متن کاملA novel hybrid method for vocal fold pathology diagnosis based on russian language
In this paper, first, an initial feature vector for vocal fold pathology diagnosis is proposed. Then, for optimizing the initial feature vector, a genetic algorithm is proposed. Some experiments are carried out for evaluating and comparing the classification accuracies which are obtained by the use of the different classifiers (ensemble of decision tree, discriminant analysis and K-nearest neig...
متن کاملAn Empirical Comparison of Distance Measures for Multivariate Time Series Clustering
Multivariate time series (MTS) data are ubiquitous in science and daily life, and how to measure their similarity is a core part of MTS analyzing process. Many of the research efforts in this context have focused on proposing novel similarity measures for the underlying data. However, with the countless techniques to estimate similarity between MTS, this field suffers from a lack of comparative...
متن کاملCombination of Transformed-means Clustering and Neural Networks for Short-Term Solar Radiation Forecasting
In order to provide an efficient conversion and utilization of solar power, solar radiation datashould be measured continuously and accurately over the long-term period. However, the measurement ofsolar radiation is not available to all countries in the world due to some technical and fiscal limitations. Hence,several studies were proposed in the literature to find mathematical and physical mod...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013